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ABSTRACT 

Modern supernova (SN) surveys are now uncovering stellar explosions at rates that far surpass 
what the world's spectroscopic resources can handle. In order to make full use of these SN 
datasets, it is necessary to use analysis methods that depend only on the survey photometry. 
This paper presents two methods for utilizing a set of SN light curve templates to classify SN 
objects. In the first case we present an updated version of the Bayesian Adaptive Template 
Matching program (BATM). To address some shortcomings of that strictly Bayesian approach, 
we introduce a method for Supernova Ontology with Fuzzy Templates (SOFT), which utilizes 
Fuzzy Set Theory for the definition and combination of SN light curve models. For well-sampled 
light curves with a modest signal to noise ratio (S/N> 10), the SOFT method can correctly 
separate thermonuclear (Type la) SNe from core collapse SNe with >98% accuracy. In addition, 
the SOFT method has the potential to classify supernovae into sub- types, providing photometric 
identification of very rare or peculiar explosions. The accuracy and precision of the SOFT method 
is verified using Monte Carlo simulations as well as real SN light curves from the Sloan Digital Sky 
Survey and the SuperNova Legacy Survey. In a subsequent paper the SOFT method is extended 
to address the problem of parameter estimation, providing estimates of redshift, distance, and 
host galaxy extinction without any spectroscopy. 



Subject headings: methods:statistical — supernovae:general 



Just prior to the turn of the millennium, |Riess| 

~ (|1999|) used 



et al. (19981 and Perlmutter et al 



multi-epoch imaging surveys to show that the ex- 
pansion of our universe appears to be accelerat- 
ing. This extraordinary result was revealed with 
the careful analysis of just a few dozen Type la Su- 
pernovae (SNIa). In recent years, supernova (SN) 
surveys such as ESSENC E ( |Miknaitis et al.||2007| 
Wood-Vasey et al.||2007|) and SNLS QHowell et al' 



2005 Brondcr et al. 2008) have begun to probe the 



nature of this Dark Energy with samples of a few 
hundred SNIa. The next generation of SN surveys 



- such as Pan-STARRS^ LSSlf] and JDEM^] - 
will increase the survey yield by another order of 
magnitude, producing thousands of new SN detec- 
tions each year. With this dramatic step forward, 
we will move beyond the saturation point of the 
astronomical community: all the time on all the 
telescopes of the world will not be enough for spec- 
troscopic follow-up of the newly discovered SNe. 

The success of these modern SN surveys is 
strongly dependent on our ability to accurately 
and precisely evaluate a SN based on photomet- 



1 http : //pan- Starrs . if a.hawaii . edu 
2 http : / /www. lsst . org 

d http : //nasascience .nasa.gov/missions/jdem 
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ric information alone. Poznanski et al. (20021 and 



Johnson & Crotts ( 2006 ) have demonstrated that 



SN colors can be used to effectively segregate SNe 
into the broad categories of core collapse (CC) 
and thermonuclear (TN) SNe. Once an object has 
been classified as a SNIa, the shape of the light 
curve can be used to determine an accurate lu- 
minosity distance (Phillips 1993). Extensions of 



this correlation to more refined parameterizations 
of the light curve shape ( Hamuy et al.|19 96 Gold- 



haber et al. 2001 1 and to multi-color light curves 



(Jha et al. 2007 Guy et al. 20071 have been in- 



creasingly successful at measuring accurate SNIa 
luminosity distances. This combination of pho- 
tometric classification and parameter estimation 
can be executed for rolling SN searches even with 
only early photometric data, as in the SNLS sur- 
vey ( |Sullivan et aTfl2006[ ). 

The use of exclusively photometric methods for 
analyzing large SN datasets raises a number of 
concerns. Sample purity is an important issue, 
especially because SNe of Type lb and Ic are dif- 
ficult to distinguish from SNIa. This can lead to 
significant biases in SNIa-dcrived cosmological pa- 
rameter estimates (Homcicr 2005). Also, without 
a spectroscopic redshift, one must be aware of the 
covariance between estimates of redshift and dis- 
tance. When the parameters of interest are poorly 
constrained by the supernova alone, one would like 
to have a mechanism for smoothly integrating sup- 
plementary data, such as host galaxy colors and 
morphology. 

To address these concerns, one branch of light 
curve evaluation techniques have turned to the use 



of Baycsian probability theory. Tonry et al. ( 2003 1 



utilized the Bayesian Adaptive Template Match- 
ing (BATM) technique (the precursor to the pro- 
grams presented in this paper) to determine lumi- 
nosity distances by comparing observed SNIa light 
curves against a library of templates. |Kuznetsova| 
& Connolly (2007[) introduced a Bayesian classi- 



fication scheme designed to accurately determine 
SN types even with poorly sampled light curves, 



and Poznanski et al. (20071 presented the Auto- 



matic Bayesian Classifier, which uses a similar ap- 
proach to classify SNe with as little as one epoch 
of data. 



Barris & Tonry ( 2004 1 outlined one of the major 



advantages of the Bayesian approach to SN light 
curve analysis. The Bayesian formalism allows the 



user to marginalize over any model parameter that 
is not relevant to the scientific question being ad- 
dressed. For cosmological studies, one generally 
wants to compare redshift to luminosity distance, 
but for a SN rate study, the redshift may be a 
"nuisance parameter" that can be integrated away. 
In addition, the Bayesian formalism can provide 
a complete and accurate treatment of the uncer- 
tainties inherent in an imperfect classification pro- 
gram. 

In this paper we present two related frameworks 
for classifying SN light curves using templates with 
a fixed light curve shape. The first of these is a 
probabilistic, Bayesian approach, and the second 
is based on Fuzzy Set Theory. This paper is di- 
vided into 8 sections as follows: 

|T] considers the necessary elements of a SN 
light curve model, and describes how the 
template-based methods presented here are 
distinguished from other light curve fitters 
by avoiding a parameterization of the light 
curve shape. 

Sj2] explains the construction of a library of tem- 
plate SNe using photometric and spectro- 
scopic data. 

Sj3] presents BATM, the Bayesian Adaptive 
Template Matching method. 

f]4] discusses limitations of the Bayesian ap- 
proach, and introduces SOFT, the Super- 
nova Ontology with Fuzzy Templates pro- 
gram. 

Sj5] describes the use of Monte Carlo simulations 
and archival SN light curves to verify the 
accuracy of these methods. 

f|6] considers how additional models can be in- 
troduced to account for non-SN contami- 
nants and to quantify classification uncer- 
tainty. 

SjT] contains a final summary and conclusions. 

1. Model Parameters 

When we observe a SN light curve, we are col- 
lecting information about the flux as a function 
of wavelength and time, / fc s (A,i). If we were to 
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make a list of all the relevant parameters that in- 
fluence the observed flux, we could divide them up 
into a set of "physical parameters" <fr and a set of 
"location parameters" 6. For TNSNethe physical 
parameter set would include things such as the to- 
tal mass of ejecta , the mass of 56 Ni, the mctallicity 
of the progenitor, the type of companion star, the 
asphericity of the explosion, etc.: 

* = (M ej ,M m ,Z,...) 

The 3> vector could be broadened to include 
CC SNeas well by including characteristics such 
as the type of stellar explosion, the presence of 
Hydrogen or Helium envelopes, etc. If we had 
a complete model for all SN progenitor systems 
and a perfect SN explosion simulator, then we 
could define a translation function T that takes 
as input a parameter vector $ and produces a 
function describing the intrinsic SN luminosity for 
any wavelength and time: 

L(\,t)=T(&,\,t) (1) 

For both TN and CC SNe, the object's location 
in time and space distorts this intrinsic luminosity 
in a predictable way, and we observe the flux. This 
distortion is predictable because we have reason- 
ably good models describing the way SN light is 
affected by cosmological distances, dust, and time, 
so we can write the observable flux as: 

'■(£->'*-'*) ltr °'"""™ 

(2) 

where z is the redshift, is the luminosity dis- 
tance, Awn_|_ z ) describes the host galaxy extinc- 
tion, and t pk is the time of the light curve peak. 
Hereafter, we will drop the A dependency when 
discussing fluxes, under the requirement that flux 
comparisons are always to be done with fluxes 
from the same broad-band filter. Throughout this 
paper, we use a simplified formulation of the four 
"location parameters:" 

e = o, fj, e , A v ,t pk ) 

The first parameter is the redshift of the object 
(z), which determines what portion of the spec- 
tral energy distribution (SED) is observed in each 



bandpass, as well as setting the time-scale of the 
light curve due to cosmological time dilation. For 
the second parameter we have recast the luminos- 
ity distance using the parameter fj, e . We define \x & 
as the difference between the true distance modu- 
lus at a given redshift and the value expected for 
an empty universe at that zj^] 

li e = (m - M) - 51og 10 25 (3) 

The third parameter is the time of the light curve 
peak (t pk ), which simply shifts the light curve 
along the time axis. Our final parameter rep- 
resents the host galaxy extinction (Ay), and is 
discussed in detail in |2.3| The parameter "Ay" 
here is a proxy for a potentially complex extinction 
function that may vary with time and wavelength. 
Throughout this paper, we treat as a set of nui- 
sance parameters, to be marginalized over for the 
purpose of getting a broad classification into one of 
the established SN categories. The task of extract- 
ing estimates of some parameters of interest (i.e. 
measuring z and [i e for cosmology) is addressed in 
the companion paper. 

1.1. Light Curve Shape 

The core function of any SN light curve eval- 
uation program is to predict the observable flux 
from various SN types, over an (infinite) range of 
possible location parameters 6. By comparing the 
model to observations we can make statements of 
probability regarding a candidate SN's class and 
location. So how does one define the model for 
predicting light curve shape? 

The fundamental problem is that there is a hid- 
den space of physical parameters 4> that deter- 
mine SN light curve shape, and we do not have a 
theoretical model that can provide the translation 



4 Note that this formulation for the luminosity distance does 
not mean that BATM and SOFT are dependent on an as- 
sumed cosmology. In Equation [3] the distance parameter, 
fi e , quantifies the dimming (or brightening) of a SN rel- 
ative to how it would appear in an empty universe. The 
only cosmological parameter appearing here is Ho, which 
factors out as a constant term, independent of redshift z. 
Therefore, when comparing the BATM/SOFT distance es- 
timates of two SNe at different redshifts, there is no bias 
introduced by assuming a cosmology, so long as we restrict 
ourselves to relative distance estimates. 
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of <fr into luminosity as in Eq. [T] In parameter- 
ized SN light curve fitters such as MLCS, SALT, 
and SiFTO, this problem is addressed by making 
the assumption that a set of shape parameters A 
can be used as a proxy for the unobscrvablc 4> 
vector^] Parametric fitters thus produce a single 
model M(A), which they assert is capable of rep- 
resenting any possible physical parameter set <fr by 
varying the parameter (s) A to modify the shape of 
the light curve. In general, the M(A) model is re- 
stricted to TN SNe, although in principle a model 
for CC SNe could be applied, although it would be 
more complex and less informative. 

For BATM and SOFT, we take an alternative 
approach, in which we do not parameterize the 
light curve shape in order to represent all (TN)SNe 
with a single model. Instead, we use a finite set of 
discrete models, each one representing the shape of 
a single well-observed SN from the local universe. 
Whereas the parametric approach essentially tries 
to fit an empirical function across the hidden <I> 
space, with BATM we are sampling the 3> space 
at discrete locations. 

2. The Template Library 

To build up our library of SN light curve mod- 
els for comparison against SN candidates there are 
three principal stages. The first step is the collec- 
tion of a suitable range of light curves for all SN 
types. Second, we must fit a smooth interpolating 
function through the observed data. Finally, we 
use a spectral model, constrained by the photom- 
etry, to predict how each template SNe would look 
if they appeared at a different location 6. 

2.1. Low-z SN Models 

We begin by collecting light curves of low red- 
shift (z<0.1) SNe that are well-sampled with data 
in all five of the standard Johnson-Cousins band- 
passes: UBVRI. We place particular importance 
on U and B-band data, because these bands are 
redshifted into optical and near-infrared wave- 
lengths at z>0.5. We also prefer templates with 
early time photometry to constrain the rise time 
of the light curve. To avoid introducing biases 
due to uncertain distances, wherever possible we 



restrict our library to SNe that are in the Hubble 
flow (cz > 2500 km s _1 ) or have an independent 
distance determination from Cepheids or surface 
brightness fluctuations. The complete set of light 
curve templates are listed in Table [T| 

Because of their utility for cosmological stud- 
ies, the literature is replete with examples of well- 
sampled TN SN light curves. Most of the TN SN 
light curves in Table [Tjwer e acquired from the col- 



lection of Jha et al. ( 2007[ ), and their original ref- 
erences are listed in the table. For host galaxy 
extinctions and distance estimates we have used 



5 Here A represents a vector including a width parameter 
such as Amis or "stretch," and possibly also a color pa- 
rameter and other higher order terms. 



the compilation of Rcindl et al. (20051, or the 
listed reference. We have divided the TNSN into 
three sub-classes. The so-called "Branch-normal" 
Type la SNe are listed as simply "la." Overlu- 
minous SNe are grouped into the "Ia + " sub-class. 
This includes objects that have been labeled as 
SN 1991T-like and SN 1999aa-like, as well as the 
unique SN 2000cx. The "la - " sub-class includes 
underluminous objects of the SN 1991bg-like vari- 
ety, as well as the very peculiar SN 2002cx. One of 
the strengths of the template-based approach pre- 
sented here is the ability to seamlessly integrate 
very peculiar objects such as SN OOcx and 02cx 
into the light curve fitting procedure. 

For CC SNe, the pool of quality templates is 
distinctly shallower. Table [T| lists the light curve 
templates for five CCSN subclasses. Each CC 
type is a very heterogeneous set in terms of peak 
brightness, light curve shape and colors. With this 
in mind, we don't pretend that just a handful of 
CC SN light curve templates can be truly repre- 
sentative of an entire class. Nevertheless, the col- 
lection listed in Table Q] is sufficient for the mod- 
est goal of distinguishing between the CC and TN 
classes. Type lb and Ic objects are the most likely 
to be misclassified, because of their photometric 
similarity to Type la SNe. Our template library 
includes three examples of typical Ib/c SNe, as 
well as two of the very high energy "hypernova" 
type (SN 1998bw and SN 2002ap). 

With the observed multi-color light curve 
points in hand, we do not yet have a functional 
template, because the discrete observation points 
leave large gaps in time where the template flux 
is as yet undefined. To convert the observed data 
points into a continuous function we perform a 
least-squares fit with a natural cubic spline curve 
in each bandpass. These UBVRI spline curve fits 
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Table 1 
Template Light Curves 



Core Collapse 

Type SN References 



Ib/c 



lib 

Iln 

IIL 
IIP 



19941 

1998bw 

1999ex 

2002ap 

2004aw 

1993J 

1996cb 

2008ax 

1999el 

1994Y 

1998S 

1979C 

1980K 

1999em 

1999gi 

2004et 



J 

15 



29 
50 

24 
23 
32 



50 



30 



14 



11 



55 



38 



Thermonuclear 

Type SN References 



la 4 



la 



Ia" 



1990N 

1991T 

1999aa 

1999dq 

1999gp 

2000cx 

1981B 

1989B 

1994D 

1996X 

1998ab 

1998aq 

1998bu 

1999ee 

2000ca 

2000cn 

2000dk 

2001V 

2002bo 

2002er 

2005cf 

1998bp 

1998de 

1999by 

2002cx 



5 35 



4T 53 



22 



28 



17 



17 



39 35 31 



51 



IB 46 



21 



17 



33 



References. — (l) |Altavilla et al.|2004| (2) |Benetti et al.|2004| (3) |Benson et al.|l994| (4) |Buta|l982| (5) |Buta fc Turner 



|1983| (6) |Candia et al.|2003| (7) |Di Carlo et al.|2002] (8) |Elmhamdi et al.|2003| (9) |Fassia et al.|2000| (10) |Foley et al.|2003 



(11) |Gal-Yam et al.||2002| (12) |Galama et al.|[l998l (13) |Garnavich et al.pOOl] (14) |Hamuy et al.||2002| (15) |Ho et aL][200T 



(16) |Jhaet al.|1999| (17) |Jha|2002| (18) |Krisciunas et al.|2000| (19) |Krisciunas et al.|200X| (20) |Krisciunas et al.|2004] (21) [Lair| 



|et al.|20Q6| (22) |Leibundgut et al.|1991| (23) |Leonard et al.|2002] (24) |Leonard et al.|2002| (25) |Lewis et al.|1994| (26) |Li et al 
|2001| (27) |Li et al]|2003| (28) |Lira et al.|1998| (29) |Liu et aL]|2000| (30) |McKenzie fc Schaefer|1999| (31) |Meikle et al.|1996| 



(32) |Misra et aL]|2007| (33) |Modjaz et"aI1|2001| (34) |PastoreIlo et al.||2008| (35) |Patat et al~1|1996| (36) |Pignata et~al"1|2004 



(37) |Qiu et al.|1999| (38) |Richmond et al.|1994| (39) |Richmond et al.|1995| (40) |Richmond et al.|1996| (41) |Riess et al.|1999| 



(42) |Riess et al.|2005| (43) |Salvo eTaLl[200T] (44) |Sollerman et al.|2000| (45) |Stritzinger et al.|2002| (46) |Suntzeff et al.|1999] 



(47) |Szab6 et al.|[2003] (48) |Taubenberger et aT1[2006] (49) |Tsvetkoy1[T982l (50) |Tsvetkov fc Pavlyukl[T995l (51) |Vink6 eFaT. 



|2003| (52) |Wang et al.|2009| (53) |Wells et al.|1994| (54) |Yokoo et al.|1994| (55) |Yoshii et al.|2003| (56) |de Vaucouleura et"al. 
[T98T1 
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Fig. 1— Template spline fits to UBVRI light 
curves for SN1998aq are shown on the left side. 
Light curve points are shown as crosses, and the 
spline curve is given as a solid line. The light 
curves for each filter have been shifted in mag- 
nitude for visibility. Residual plots in the panels 
on the right side show the magnitude difference 
between the observed light curve points and the 
spline fit. The very small and largely uncorrelated 
residuals demonstrate that interpolation with this 
fit can accurately reproduce the true SN colors at 
times where there is no observational data. See 
the electronic edition of the Journal for a color 
version of this figure. 

provide excellent interpolation of the true light 
curve, as shown in Figure [TJfor the SN1998aq tem- 
plate. The residuals of the observations around 
the spline fits are consistent with the observational 
errors. For any point in time, the template spline 
fits provide UBVRI magnitudes with uncertainties 
on the order of 0.02 mags or less. 

2.2. Redshifting and Reverse K-correction 

We would now like to adapt these low-z UBVRI 
light curves to a grid of points in the four dimen- 
sions of location parameter space 6= (z, fi e , t pk , 
Ay), while simultaneously translating them into 
the filters used for observing the candidate object 
(for this paper we have used griz filters from the 
Pan-STARRS survey for our Monte Carlo simu- 
lations, plus the g'r'i'z' filter set from SDSS). To 
carry out this process perfectly, we would need to 
have a complete spectrum of the template object 



Fig. 2. — In the ideal scenario for a template li- 
brary, each template would have complete spec- 
trophotometric information, providing the relative 
flux value at any wavelength and any time. The 
thin light gray ribbon (orange in the electronic ver- 
sion) traces a slice across the time axis, which pro- 
vides a single spectrum from t=5 days past peak. 
The broad dark gray (blue) slice indicates the V 
bandpass. Integrating over the region where the 
t=5 ribbon overlaps the broad V band would pro- 
vide a single data point for a template light curve. 
See the electronic edition of the Journal for a color 
version of this figure. 

at all points in time (Figure |2|. This ideal sit- 
uation is unattainable, so we must approximate 
it by collecting spectra from many similar objects 
and piecing them togeth er. For the TN SNewe use 
the composi t e spec tra of |Nugent et al. ( 2002 and 
Hsiao et al.| ( |2007| p1 For CC S Ne we use the col- 



lection of spectra employed by Blondin fc Tonry| 
fl2007| )p1 

These spectral compilations provide a complete 
SED at a set of discrete points in time, but we 
want to be able to provide a template magnitude 
at any point in time. To do this, we can use 
our temporally complete photometric model (i.e. 
Figure [I]) to constrain the time evolution of our 
sparsely sampled spectral model in four steps: 



e http : //supernova, lbl .gov/~imgent/nugent .templates . 
html I 

^http : //www. astro .uvic . ca/~hsiao| 

^http : //cf a- www .harvard. edu/~sblondin/sof tware/snid 
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days from peak (obs frame) 



Fig. 3. — A subset of the family of rcdshiftcd 
light curve templates derived from the SN1998aq 
parent light curves shown in Fig. [I] and the SEDs 
shown in Fig. [2] All light curves are in the PS1 
i-band with no shifts applied for extinction. The 
light curves are spaced in redshift from z=0.1 to 
1.5 with steps of 0.1, with highlighted (red) curves 
at z=0.5, 1.0, and 1.5. Each light curve has been 
dimmed appropriately for its redshift, if it were in 
an empty universe. See the electronic edition of 
the Journal for a color version of this figure. 

1. Correct the SED for extinction. 

2. Iterative ly warp the SED until the inte- 
grated broad-band fluxes match the UBVRI 
flux values drawn from the template spline 
at the time the SED was taken. 

3. Redshift the warped SED to the desired z. 

4. Integrate the SED over the bandpass of in- 
terest to get a new broad-band magnitude. 

This process is repeated for every available 
SED, producing a new set of multicolor light curve 
points that represents what the input template 
would look like if it appeared at the given redshift 
z and were observed in the given bandpass. At the 
end of this process, we have effectively translated 
the UBVRI input data from our templates in Ta- 
ble [T] into griz points at any redshift, using the 
SED's as the bridge between. 

Now each of these high-redshift griz light curve 
models has a discrete point in each bandpass for 
every one of the 60+ spectral templates. The 



dates of observation for the candidate light curves 
will not coincide with the times of these discrete 
points, so we once again turn to spline interpo- 
lation to fill in the gaps. The same process of 
least-squares spline fitting that was used for the 
low-z UBVRI light curves is now carried out for 
each redshifted griz light curve. Figure [3] shows 
the set of spline curves created from warping the 
SN1998aq template to 15 redshifts. These result- 
ing spline curve templates are "adaptive" in that 
they can be shifted up or down to accommodate a 
different value of fi e , reddened to account for host 
Ay, and shifted left or right for different values of 

fpk- 

Note that when applying a redshift in step 3 
we avoid making any assumptions of cosmologi- 
cal parameters by applying the redshift as if the 
template SN were in an empty universe (Qm — 
il\ = 0). Because we have also defined the dis- 
tance modulus /i e relative to an empty universe 
(Eq. [3]) , it becomes a trivial addition operation to 
adjust any redshifted template to a new luminos- 
ity distance /j. e . 

2.3. Host Galaxy Extinction 

Dust extinction in a SN host galaxy can mimic 
the dimming of a light curve due to distance. This 
adds additional scatter - or possibly a systematic 
bias - to any relations derived from SN distances. 
Given that one primary goal of our SN light curve 
analysis program is to enable cosmology with pho- 
tometry, BATM and SOFT must be able to dis- 
entangle these very similar effects from analysis of 
the light curve shape alone. 

For an arbitrary bandpass X, the host galaxy 
extinction is given by Ax = (Ax/ Ay) * Ay mag- 
nitudes. The visual extinction Ay is a free param- 
eter in our models, so we need to determine the 
ratio Ax /Ay for each bandpass of interest in order 
to synthetically redden our template spline curves 
to match observations. As described above, each 
template light curve is constructed from a collec- 
tion of spectra that have been appropriately red- 
shifted and warped to match the template's pho- 
tometric light curve. As the template library is 
being built, we integrate each of these redshifted 
template spectra over the bandpass X to get an 
unextinguished magnitude mo . We then apply the 
Ry=3.1 extinction law of ICardelli et al.| ( |1989[ ) 



to the spectrum and repeat the spectral integra- 
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tion to get a reddened magnitude m re d- The 
reddening for our bandpass X is now denned as 
Ax = {mred — m o)- This procedure is repeated 
for several input values of Ay to get a sampling 
of Ax (Ay), from which we can compute the slope 
Ax I ' Ay ■ Each light curve in the template library 
then carries with it a measurement of the Ax I Ay 
ratio, allowing for Ax to be computed very effi- 
ciently for any physically appropriate value of Ay. 



Nugent et al. (20021 and Jha et al. (2007) have 



measured the time dependence of extinction in 
Type la SN light curves, demonstrating that Ax 
can vary by a few percent over the first few months 
after explosion (reaching as much as 5% in the 
I band at early times). This low-level reddening 
variation should be included in our handling of 
the SN templates, but further work is needed to 
quantify this effect on CC SN light curves. For the 
present work, we use the median Ax value at all 
times. 

An additional concern is whether the ratio of 
total to selective extinction, Ry, may have sub- 
stantial variability from galaxy to galaxy or SN to 
SN. When the extinction due to dust in the host 
galaxy of a SN is low, we can safely ignore the ef- 
fects of a non-standard extinction law. However, 
any highly reddened SNe with a different host Ry 
will produce systematically biased parameter es- 
timates. This can easily be addressed within the 
B ATM/SOFT framework by allowing Ry to be a 
free parameter (which can be marginalized over). 
To simplify the Monte Carlo testing presented in 
this paper, all of our simulated light curves are 
generated with a fixed extinction law that is set 
to the Milky Way value of Ry=3.1 

Furthermore, the extinction law for SNe may 
change at higher redshift due to increased rates of 
star formation, and it is possible to account for 
this effect in the choice of prior ( |Holwerda|[2008| . 
Additionally, selection effects lead to lower detec- 
tion rates of highly extinguished SNe at high z. 
Accordingly, the Ay prior should change with z, 
although in our verification tests in £j5]we employ 
a single Ay prior at all redshifts. For testing with 
synthetic SNe, the Ay distribution can be dictated 
in the data simulation so the use of an Ay prior 
that is constant with z is appropriate. A more 
complex Ay prior would be preferable when ap- 
plying these methods to high-redshift SN surveys. 



3. Bayesian Probabilities 

To develop the probabilistic framework of the 
BATM method, let us start with a candidate ob- 
ject called "SNX." Suppose we have N data points 
in the light curve of SN X for which we have mea- 
sured the time t, flux /, and flux uncertainty a, so 
that the i data point is given by Dj = (i i; f t , a^). 
Each model Mj at each location 6 gives a single 
deterministic prediction for the observable flux as 
a function of time: J~j(t, 9). To begin with, we as- 
sume that there is no uncertainty inherent to the 
model (we will reexamine this assumption in fQ. 
Thus, if the physical model Mj and location pa- 
rameters 6 are correct, then the observed fluxes of 
our candidate SN X should deviate from the model 
only by random observational errors. We assume 
those errors have a Gaussian distribution, so the 
probability of observing the \ th data point is just 
the probability of the error term: 



1-KO. 



exp 



2a? 



(4) 

If the errors are independent, then we can com- 
pute the likelihoocj^Jof a match between SN X and 
model Mi at location as: 



P CD\9,M J ) = l[p(D l \e,M J 



(5) 



To determine the net likelihood of model Mj, 
we multiply Equation [5] by p(6\Mj) - the prior 
probability distribution for location parameters 



(see £ 3.1 ) - and integrate over all possible vectors 



0: 



p(D\Mj) = J p{e\M j )p(D\0,M j ) d6 



(6) 



If we have only one model available, or if we 
assume that the model Ma is correct, then we can 



9 Note that this is in fact a likelihood and not yet a true 
probability, in the sense that the integral of Eq. [5] over all 
models Mj and all locations will not be equal to unity. 
Thus, p(D\Mj , 0) is merely a dimensionless function that 
quantifies the likelihood that the data set D could be ob- 
served under the assumption that Mj and are a perfect 
model for SNX. 
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use Bayes' Theorem to define the posterior prob- 
ability as a function of location 9: 



P{0\D,M S ) 



p(e\M 3 ) P (D\e,M 3 ) 

p(p\Mj) 



(7) 



In our case, however, we are considering a list of 
many applicable models. To evaluate the success 
of each model in reproducing the data, we must 
repeat the calculation of Eq. [6] for all available 
models Mj. We can then apply Bayes' Theorem 
to define the Posterior Model Probability (PMP), 
or the "model weight," as: 



PMPj = p(Mf|D) = 



p{M 3 )p(T>\M ) 
E p(M k )p(D\M k ) 



(8) 



Each PMP is a scalar value between and 1 
that indicates the degree to which a given model 
Mj is preferred by the data D, relative to all other 
models used. The PMPs from all models can be 
combined with posterior probabilities from Eq. [7] 
to get model- weighted parameter estimates, using 
the method of Bayesian Model Averaging ( |Hoeting] 
et al.|1999 1 . The problem of parameter estimation 
is addressed in more detail in a companion paper 
(Rodney and Tonry, 2009), while for the remainder 
of this discussion we restrict ourselves to the task 
of model selection. 



p(Mj) 



K 



class -^disc 



(9) 



Nclass 

The first factor, lZ c i ass is the rate of SNe from the 
sub-class containing model Mj, as derived from 
past surveys (we have used the compilations of 



Blanc & Greggio (20081 and Cappellaro et al 



( 1999 )). This intrinsic rate is then scaled by F<n. 



the fraction of objects from this class that would 
be detected by our survey, which we derive us- 
ing Monte Carlo simulations of survey operations. 
Finally, we divide by a dilution factor, N c i ass , to 
account for the presence of multiple discrete mod- 
els representing the same subclass (George 1999). 
Suppose the prior probability for the II-P class as 
a whole is TZiipFdi SC — 0.2, and we have 3 tem- 
plates in that subclass. If we assign each of the 
II-P templates a model prior of p(Mj) = 0.2, then 
the total prior weight in the II-P class is artifi- 
cially inflated to 0.6. Thus, to avoid introducing 
a bias due to having different numbers of models, 
we divide by the number of templates within each 

Class, Ndass- 

3.1.2. Location Priors 

If we assume that the four location parameters 
9=(z,fi e ,Ay ,t pk ) are uncorrelated, then we can re- 
cast the location prior as a product of constituent 
parts: 



3.1. Priors 

As in all Bayesian applications, the choice of 
prior probabilities must be undertaken with some 
care, to avoid introducing unintended biases. For 
Equations [6] and [8] we need to define two types of 
priors: (1) a scalar prior for each model Mj, based 
on the expected rate of SNe in each class; and (2) 
a prior probability distribution over 9, reflecting 
our initial understanding of the allocation of SNe 
throughout the universe. 

3.1.1. Model Priors 

The model prior p(Mj) should describe the a 
priori probability that a supernova matching the 
model Mj could be discovered by our survey, re- 
gardless of location 9. We have set the model pri- 
ors for each SN sub-class in Table [l] according to 
the following formula: 



P (9\M 3 ) = p{z\Mj)p{p: e \Mj)p{k v \Mj)p{t pk ) 

(10) 

Note that the t pk prior is independent of the model 
choice, because the calendar date of a SN explosion 
is not influenced by the type of SN event. All other 
location priors could be dependent on the model 
or class of models under consideration (e.g. The 
prior for host galaxy extinction might be different 
for CC vs TNSN). Whenever possible, these pri- 
ors should be informed by spectroscopy and host 
galaxy photometry. In the absence of such infor- 
mation, a uniform prior may be applied. In all 
cases it is important to scale each prior probabil- 
ity distribution so that it integrates to unity over 
the range of values that BATM has access to. 

In the verification tests presented in ^5] we 
have employed a uniform prior for for the redshift 
p{z\Mj), the cosmological dimming p(p e \Mj), and 
the time of light curve peak p(t pk ). For the host 
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galaxy extinction prior p(Ay|M 7 ) we have used 
the "galactic line-of-sight" (glos) prior employed 
by the ESSENCE team QWood-Vasey et al.|2007| ), 
which is based on the host galaxy extinction mod- 



els of Hatano et al. (19981, Commins (20041, and 



Ricllo & Patat ( 2005 1 . The form of this prior is an 
exponential plus a truncated Gaussian, expressing 
the preference for matches that select a low Ay 
value. 

3.2. Classification 

To extract a classification probability with 
BATM is now straightforward. We simply com- 
pute the PMPs for all models, then sum together 
the PMPs from all models belonging to the class 
of interest. For example, to determine the proba- 
bility that an object is of Type la: 



p(Ia\D)= PMP k 



(11) 



k<£{Ia} 



where the sum is over just those templates in the 
Type la subset, {la}. The classification probabil- 
ity for any other class or subclass can be found in 
the same way, and these normalized classification 
probabilities are related through simple additive 
statements: 



p(TN\D) = p(Ia+\D) + p{Ia\D) + p(Ia~\D) 

p(CC\D)= p(Ibc\D)+p(IIb\D)+ 

p{IIP\D) + p(IIL\D) + p(IIn\D) 

p(TN\D)+p(CC\D) = 1 

4. Fuzzy Models 

In the preceding section we laid out the BATM 
framework for probabilistic comparison between 
SN light curve data and template-based models. 
The alternative Fuzzy Set-based method described 
here (SOFT) is motivated by a careful consider- 
ation of the assumptions required by BATM. In 
order to apply the tools of probability theory in 
BATM, we invoked three key assumptions: 

1. Each SN model Mj provides an exact deter- 
ministic prediction for the flux as a function 
of time. 



2. The set of all models {M} is complete : all 
possible light curve shapes are represented. 

3. The set {M} is non-redundant : a candidate 
SN cannot be a true match to more than one 
M r 

These assumptions would be justifiable if we were 
using a parameterized SN model that ostensibly 
covers the entire hidden range of physical param- 
eters However, the premise breaks down when 
we use a discrete set of templates with fixed light 
curve shapes. 

In the language of set theory, we can define a 
universal set U that contains all possible SNe with 
any physical parameters 3> and any possible loca- 
tion 6. Each of our discrete set of models Mj 
defines a subset Sj C U containing all possible 
SNe that have precisely the physical parameters 
observed at any location 9. These subsets are 
classical or "crisp" sets, in that membership in the 
set is a binary condition: either you are in or you 
are out. This is in keeping with the determinism 
of assumption (1) above, which was used to de- 
rive Eq.s [4][5j From this we can immediately see 
that any discrete and finite set of models cannot 
satisfy both assumptions (1) and (2): if the mod- 
els are presumed to be deterministic, then there 
will always be some subset Sx C U that is not 
represented. 

How does this contradiction affect our classifi- 
cation estimates? Suppose we have an extremely 
sparse template library, consisting of only two 
template models, Mi and M 2 , which have under- 
lying physical parameters 3>i and $2(^3 Now let 
us observe a candidate object, SN X, which has 
physical parameters &x that are intermediate be- 
tween $1 and $2 (e.g., perhaps the candidate has 
a 56 Ni mass that is bracketed by the two tem- 
plates: M Nl ^ < M Ni ^ x < Mma)- SN X is not a 
member of either of the crisp subsets Si and S 2 
defined by our two models, so the classification 
likelihood of Eq. [6] will be very small for both Mi 
and M 2 . If the data quality is high (<Ji — * 0), then 
the computed likelihood of a match can become 
vanishingly small, and we will be unable to com- 
pute a meaningful classification probability. This 
is precisely the opposite of what we expect from a 



10 This reduction is only used for convenience. The conclu- 
sions drawn are equally valid for any finite set of models. 
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respectable SN classification algorithm: high sig- 
nal to noise observations should lead to more pre- 
cise classifications, not more classification failures. 

The problem we are exposing is a fundamental 
limitation of a template-based Bayesian classifier: 
one cannot reliably classify SNe that fall into the 
gaps between discrete templates when constrained 
by our three core assumptions. 

4.1. Defining Fuzzy Sets 

One possible means of addressing this problem 
of discrete models is presented by the field of logi- 



cal analysis known as "Fuzzy Set Theory" (Zadch 



1965). To use this mathematical framework, we 



must first adjust our models so that they define 
"fuzzy sets," whose membership is not crisply de- 
fined. Some examples of fuzzy sets in the realm of 
astronomy are categories such as "high mass stars" 
or "rich clusters." The semantic meaning of these 
groups is clear enough, but determining whether 
a certain object belongs within the set is some- 
what ambiguous. A 100 M star certainly belongs 
within the "high mass" group, and a 0.5 M star 
does not, but does the membership line get drawn 
at 2, 5 or 10 M ? For our purposes, we want to 
explicitly introduce some fuzziness into the defini- 
tion of our models, so as to ensure that any can- 
didate SN X has some degree of membership in at 
least one of the subsets Sj. As discussed below, 
we will need a new method for combining the ev- 
idence from multiple fuzzy models when deriving 
a composite parameter estimate. 

Membership in a fuzzy set A is defined by a 
membership function g(a;|j4) which takes any pos- 
sible set member x and assigns it a "membership 
grade," which is a real number in the range [0,1]. 
For crisp sets, the membership function is binary, 
always providing = 1 for set members, or 

g(x\A) = for non-members. For fuzzy sets, a 
membership grade close to unity indicates a strong 
association with the set, while g(x\A) ~ suggests 
the opposite. 

To see how we might define a membership func- 
tion for fuzzy SN sets, let us consider the family 
of template light curves derived from SN 1998aq, 
as shown in Figure [3] The actual supernova event 
1998aq had some unknown set of particular phys- 
ical parameters $gs a9 . The model we constructed 
from the 1998aq light curve template allows us to 



produce the precise light curves of Figure [3] which 
predict how a SN with identical physical parame- 
ters 3>98a<j would appear if it were observed at a 
different location 9. Suppose we introduce a small 
perturbation of the physical parameters, (5<fr, per- 
haps by adding a little more 56 Ni or allowing for 
a bit more mixing in the ejecta. This makes a 
slightly different explosion, with physical charac- 
teristics now given by 



98aq 



98a<7 



<5* 



Recalling the underlying assumption that changes 
in the physical parameters will be reflected in 
changes of the light curve shape, we can assume 
that the small physical perturbation 5& results in 
a small change to the observable flux: 

f(X,t) = f 98aq (X,t)+6f(X,t) 

This new explosion would fall outside of the crisp 
set defined by the SN 98aq model, but we can in- 
clude it within an expanded fuzzy set that is cen- 
tered on SN 98aq. 

What form should the membership function for 
this fuzzy set take? In words, the membership 
function will define the fuzzy set of "All SNe with 
light curves similar to Mj at location <?." To quan- 
tify similarity in this statement, we introduce a'j as 
a "model fuzziness" term. The a'j term has units 
of flux, and it defines the width of a Gaussian dis- 
tribution around the model Mj that indicates how 
much flux deviation from the core model is allowed 
for members of this fuzzy setj^J Consider our can- 
didate SN X, and for the moment let us ignore 
observational errors (cr^ ~ for all N data points). 
If we assume the fuzzy model Mj and location 6 to 
be correct, then the probability of getting the ob- 
served flux discrepancies between the model and 
the data due to model uncertainty alone is: 



-{fi-Ti{U,6)Y 



N 1 / 

(12) 

This mimics Eq. [5] which provided the likeli- 
hood of observing a light curve D based on obser- 



2 < 



11 We have chosen the Gaussian function for convenience. 
Another form (such as a Lorentzian or a Voigt Profile) could 
be equally appropriate. 
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vational errors only. To combine these two prob- 
ability estimates we convolve Eq. [5] with Eq. [12"| 
(e.g. |Gregory||2005| Sec.4.8) to get: 



p"(T>\9,Mj) = 



N 

n 



x exp 



2 (<r? 



(13) 



We can then modify this likelihood to account 
for our priors on 9 and Mj , which produces a final 
likelihood estimate suitable for use as the mem- 
bership function of a fuzzy set: 



g(T>\9, Mj) = p(e\Mj) p{Mj) p"(D|0, Mj) (14) 

This, then, is the membership function for our can- 
didate SN X in the fuzzy set defined by (9, Mj). 
This fuzzy set contains all possible SNe that are 
similar to the fuzzy model Mj and are also at the 
precise location 9. 

These membership functions are very similar to 
the probabilities used in Sj3] but there arc some im- 
portant distinctions. First, consider that in clas- 
sical probability theory when one assumes a given 
model Mj, it is then required that the posterior 
probability over 9 must integrate to unity: 



/ 



p(6\D,Mj) d9 



This requirement is enforced by the presence of the 
normalization factor in Eq. [7J It is not necessarily 
appropriate to apply the same normalization to 
these fuzzy membership functions. 

Furthermore, when we apply Bayes' Theorem 
to compute the PMP in Eq. [HJ the denominator 
ensures that the sum of all PMPs is equal to unity: 

Y^PMPj = J2p(Mj\B) = l 

3 3 

This normalization is not a feature of the fuzzy set 
membership functions: 



J 9(T>\d,Mj)d9^1 



The reason for this difference is that fuzzy sets 
can overlap, and therefore they do not satisfy the 



Kolmogorov axioms^] In our fuzzy sets a candi- 
date SN can be similar to template Mi while also 
being similar to M 2 . Thus, any given candidate 
can have non-zero membership grades in multiple 
fuzzy sets, and the sum of membership grades can 
be greater than or less than unity. 

4.2. Combining Fuzzy Sets 

By comparing the light curve of our candidate 
SN X against all models Mj, we can derive mem- 
bership grades g(T>x\9, Mj) for all possible val- 
ues of 9. We would now like to combine the re- 
sults from all the TNSN model comparisons to 
get a composite membership grade for the fuzzy 
set containing all TN SNe. After also computing a 
composite fuzzy membership grade for the CC SN 
class, we could compare the relative strengths of 
membership to determine a classification. How do 
we go about combining multiple fuzzy set mem- 



bership functions from Eq. 14? 



Zadeh ( 1965 1 in his seminal paper on fuzzy 
set theory proposed the following rules defining 
the "fuzzy union" and "fuzzy intersection" of two 
fuzzy sets: 



g(X\A UB) = max [g(X\A), g(X\B)} 
g(X\Ar\B)=mm [g(X\A),g(X\B)} 



(15) 



Zadeh's combination rules are illustrated graph- 
ically in the left two panels of Figure [4] |Bell-| 
man & Giertz ( 1973 1 provided an axiomatic frame- 



work for fuzzy set operators that justifies Zadeh's 
min/max rules as the purest possible operators. 
That is, the max function provides the most inclu- 
sive fuzzy OR, while the min function is the most 
exclusive AND operator. However, the min/max 
functions are not the only allowable operators. 
Several classes of functions can satisfy the re- 
quired axioms while also allowing some varia- 
tion in the level of inclusivcncss or exclusivencss 



(e.g. 


Hamacher|1978 


Yagerl980 


Dubois & Prade 


1980b 


|. These operators are called interactive be- 



cause rather than choosing just one of the mem- 
bership grades g(X\A) or g(X\B) as the min/max 
operators do, these functions allow both the A and 



12 Specifically, the 2nd and 3rd Kolmogorov axioms, stating 
that J^V P{^i) = 1 for a complete set of elementary events 
d, and that P(A U B U C) = P(A) + P(B) + P(C). 



12 



1.0 
0.8 
0.6 
0.4 
0.2 

■5 o-o 



1.0 
0.8 
0.6 
0.4 
0.2 
0.0 



3 Fuzzy Set 
Membership 


-unctions 


Sum/Produ 
} 

— 


ct 


i j/ 




q=oo (min/max) 


q = 0.2 





erators that is general enough to encompass the 



0.2 0.4 0.6 0.8 0.2 0.4 0.6 0.1 

e 



Fig. 4. — Demonstration of combination rules for 
fuzzy sets. In the upper left panel are shown three 
hypothetical fuzzy set membership functions g±, 
g2, and 93. The remaining three panels show dif- 
ferent options for the combination of these fuzzy 
sets. In each case, the aggregate fuzzy set cre- 
ated by applying the conjunction operator (OR) is 
shown as a (green) dashed line: gi U <?2 U gz- The 
disjunction operator (AND) appears as a filled 
curve (blue), showing g\ n 52 H gz These combined 
sets are displayed with arbitrary normalization for 
clarity. Upper Right: If the membership func- 
tions were treated as classical probability func- 
tions, then one would use the Sum operator for 
OR, with the Product operator for AND. Lower 
Left: Applying the Dombi functions for fuzzy set 
operators with q = 00, we recover a special case 
where OR is the max function, and the min func- 
tion serves as the AND operator. Lower Right: 
A small value of q — 0.2 emphasizes agreement 
between fuzzy sets. The resulting union and in- 
tersection sets are less sharply peaked, and less 
strongly influenced by outliers. 

B functions to contribute simultaneously. For fur- 
ther discussion of the axiomatic requirements of 
fuzzy set operators and comparisons of some inter- 



active classes, see section II. 1 of Dubois fc Prade 



(1980a I and Chapter 2 of Klir fc Folger (19881 



operators of 


Zadeh 


(1965 


), 


Hamache 


Yager 


(1980 


as special cases. Let a 



and b repre- 
sent the membership grades of our candidate ob- 
ject X for the fuzzy sets A and B, respectively. 
Then the Dombi membership function for X in 
the fuzzy union set A U B is: 



g(X\AUB)=— x - — u- rifi >, 

and for the fuzzy intersection An B Dombi has: 



g(X\AHB) 



i+ia-ir+G-ir 



(17) 



Dombi ( 1982 1 derived a family of fuzzy set op- 



where the parameter q is in the range (0, oo). Fig- 
ure [3] shows the shapes of union and intersection 
sets resulting from a combination of three fuzzy 
set membership functions. The q parameter gov- 
erns the strength of interaction between the two 
membership function arguments A and B. In the 
limit where q— > oo the Dombi union function re- 
duces to Zadeh's max operator, while the Dombi 
intersection becomes Zadeh's min function. As the 
value of q decreases, these fuzzy operators become 
progressively "softer," allowing more and more in- 
teraction between the sets. 

4.3. Calibrating Fuzzy SN Models 

With the fuzzy SN models and combination op- 
erators described above, we now have a complete 
mathematical framework for combining fuzzy sets. 
To get here, we have introduced two new param- 
eters: a'j describes the amount of fuzziness in our 
SN models, and q controls the amount of inter- 
action allowed when combining sets. These pa- 
rameters are actually characteristics of the tem- 
plate library, and may be estimated empirically 
by inter-comparison of templates. 

If our template library consisted of just two or 
three models with substantially different physical 
characteristics 4>, then we would need to make 
them very fuzzy (a very large a'j) in order to fill 
the space between models. With a few hundred 
templates the need for fuzziness would be reduced, 
so o~j could be very small. To calibrate the amount 
of fuzziness required for our template library, we 
measured the flux difference between pairs of tem- 
plates belonging to the same SN sub-class, at the 
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same location 8. Taking a median value from all 
the possible pairwise comparisons, we determined 
that setting a'j to be 10% of the template flux (ap- 
proximately 0.1 mag) is an appropriate degree of 
fuzziness for the set of TN SN templates listed in 
Table [TJ For the set of CCSNe, with fewer tem- 
plates and greater heterogeneity, we found that 
slightly more fuzziness is necessary, at the level of 
a'j = 0.15 Fj{t,e). 

As shown in Figure [4] setting q = 0.2 results 
in a fuzzy union (dashed line) that emphasizes 
the overlap between sets. In this sense it is a 
sort of hybrid between the strict interpretations 
of the classical OR and AND operators. Thus, 
the q = 0.2 fuzzy union achieves the goal stated 
at the top of |4.2| in that it indicates how much 
a candidate SN X is similar to any or all of the 
available models. 

A more rigorous empirical calibration of q could 
be done using a training set of SNe with known 
locations 8. After comparing the training set 
against the template library, one would adjust the 
q parameter so that fuzzy model combination pro- 
vides the most accurate estimate of the true lo- 
cation. Such a detailed calibration of q is beyond 
the scope of this work, but we have performed pre- 
liminary tests using TNSN light curves from the 
Sloan Digital Sky Survey (SDSS) ( |Holtzman et al. 
2008). This rough, empirical calibration showed 



that q values in the range (0.1,1.0) provide accept- 
able results with this template library. 

4.4. Fuzzy Classification 

We now have all the necessary tools for execut- 
ing the SOFT method on SN light curves. The 
procedure is as follows: 

1. For each template Mj, at every location 8, 
subtract the model flux Tjit^ff) from the ob- 
served flux of the candidate fi(t). 



2. Using Eq.'s 1~3]|T4 determine the fuzzy set 
membership grade of this candidate over all 
available parameter space. 

3. Collect membership functions into groups 
according to SN sub-class, excluding any 
that are zero everywhere. 

4. Combine the membership grades within each 
sub-class using the fuzzy union operator of 



Eq.[T6] 

The result is a final composite membership 
function for each SN sub-class, g(D\6,Y), giving 
the degree of membership for SN X in sub-class Y 
as a function of location 8. The peak of the distri- 
bution g(D\6,Y) provides an estimate of the true 
location 8, and the width of g(D\0,Y) indicates 
the uncertainty of that estimate (see Paper 2 for a 
complete discussion of parameter estimation with 
fuzzy templates). The integral of g(D\6,Y) over 
8 gives a classification grade for that particular 
sub-class. 

Given the similarity between fuzzy set classifi- 
cation grades and Bayesian posterior model prob- 
abilities, one might be tempted to normalize each 
sub-class grade by the sum over all sub-classes, 
in a manner similar to Eq. [8] In general this 
is not appropriate, because fuzzy sub-classes are 
not mutually exclusive, and therefore don't follow 
the Kolmogorov axiom of additivity: Paubuc — 
Pa + Pb + Pc- Any subclasses that might have 
some membership interaction should be combined 
using Dombi's fuzzy intersection or union opera- 
tors. The one useful exception arises when com- 
paring relative classification grades for the TN and 
CC SN classes. In that case we can assert that 
these two classes represent fully isolated sets, be- 
cause there is a significant physical difference be- 
tween any Type la and any core collapse SN. Thus, 
the TN and CC fuzzy sets can be treated as crisp 
sets and combined additively. 

The normalized membership grade for the 
TN SN is analogous to the Posterior Model Prob- 
ability (PMP) of Eq. m We call this quantity the 
Posterior Membership Grade (PMG), and calcu- 
late it as follows: 



PMGtn = g{TN\T>) 



U 9(P\M t ) 

iGTN 



U ff(D|Af0+ U 9<P\Mj) 

i£TN 3 eCC 

(18) 

where \J #(D|M;) 

i£TN 

indicates the fuzzy union of membership grades 
from all templates in the TNSN class. The 
corresponding classification grade for CC SNe, 
PMGcCi is defined in the same manner. 



14 



5. Verification Tests 

To evaluate the utility of the SOFT template 
matching technique, we first generated a Monte 
Carlo simulation containing 5000 synthetic light 
curves for each of the TN SN and CC SN classes, 
evenly distributed over the sub-classes. Each syn- 
thetic light curves was created using a template 
that was prepared in the way described in fj2] 
To begin, the template was warped and shifted 
to a randomly selected point in the bounded 4- 
dimensional parameter space of 9—(z, fi e , Ay, 
t pk ). We then generated an "observed" griz light 
curve for each synthetic object, using a simula- 
tion modeling 120 nights of the Pan-STARRS 1 
Medium Deep survey. In terms of wavelength 
coverage, cadence, and depth, this simulation is 
roughly intermediate between the SDSS-II Super- 
nova Survey ( |Sako et al. 20081) and planned sur- 
veys from projects such as the Large Synoptic Sur- 
vey Telescope ( |Tyson||2002 1 . 



The location parameters 0=(z,/u e ,Ay ,t pk ) were 
drawn from a uniform distribution over a broad 
range of parameter space: 



0.001 < z < 1.0 
-0.8 < fl e < 0.8 
0.0 <A V < 1.5 
20 < t pk < 100 



By choosing a uniform distribution in z, /i e , and 
Ay, these Monte Carlo simulations are clearly not 
depicting a realistic survey yield, where the rate 
of detection depends on cosmology and the under- 
lying SN explosion rate. The purpose here is to 
determine whether there are regions of parameter 
space where the SOFT methods are poor classi- 
fiers, which could lead to sample selection biases. 

Our second data set for testing classification 
accuracies is a collection of 222 published light 
curves from the SDSS-II and SNLS surveys. The 



SDSS light curves, from Holtzman et al. (20081 



consist of 146 objects that have spectroscopic red- 
shifts ranging from 0.01 to 0.45. All of these ob- 
jects were classified as Type la, although 16 were 
given a "SNIa?" classification, indicating some 
uncertainty in their designation as TN SNe. These 
SNe are well sampled in the Sloan u'g'r'i' z' bands, 



with observations done in all 5 filters on a single 
night, a typical cadence of less than 5 nights, and 
sensitivities of around 22 mags. 

From the SNLS survey we include 71 objects 



from Astier et al. ( 2006 1 with spectroscopic red- 
shifts ranging from 0.25 to 1.0. Fifteen objects 
in this set were given the classification "la*" to 
indicate that the best spectral match is a SN la 
model, but that an alternative classification can- 
not be ruled out. The griz light curves of these 
71 SNe have a similar sampling frequency to the 
SDSS set, with typically 8-15 observational epochs 
over 100 days in three or more filters. Finally, we 
include an additional 5 SNLS light curves from 



Nugent et al. (20061 that are classified as Type 



II-P supernovae. These objects have redshifts be- 
tween 0.1 and 0.3. 

In Figure[5]we show two examples of the real SN 
light curves used in these verification tests. The 
first is SN 2005jb from the SDSS survey, which has 
a spectroscopic redshift measurement of z=0.258. 
The second example is SNLS-03D4cz, at a redshift 
of 0.695, from the SNLS survey. Over-plotted lines 
in this figure show the SOFT template match that 
produces the maximum likelihood for each of these 
light curves. 

5.1. Results 

The classification results from our Monte Carlo 
simulations are shown in Figures [6] and [7] for the 
TNSN and CCSN model templates, respectively. 
The left-hand panel of each figure shows the z- 
fi e plane divided into a regular grid. In each box 
on the grid, we collected all the synthetic light 
curves whose input parameters (z,/i e ) fell within 
that box. There are 400 boxes, so each box con- 
tains roughly a dozen of the 5000 synthetic light 
curves, and each of those dozen light curves has 
a slightly different /i e and z, and perhaps a very 
different set of input values Ay and t pk . Thus, 
each contributing synthetic SN light curve yields 
a different set of classification grades PMGtn and 
PMGtn ■ Combining all contributing light curves 
for a given box, we can compute the average classi- 
fication grade for each of the two principal classes. 
The resulting classification grade image shown in 
Figures [6] and [7] is black in regions where most 
synthetic SNe are classified as TN SNe, and turns 
white in regions where most are found to be CC 
SNe. Another representation of these data is given 
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Fig. 5. — Two examples of the maximum likelihood SOFT template matches for real SN light curves from 
the verification data. Left: SN 2005jb from the SDSS survey, with a measured redshift of z=0.258. The 
best-fitting light curve from the SOFT library is based on the normal Type la SN 1996X. Note that the v! 
band is not shown. Right: SNLS-03D4cz from the SNLS survey. The spectroscopic redshift of this object is 
z=0.695, and the maximum likelihood match from SOFT is based on SN 2000cn. 



in Figure [8] which shows the average classification 
grade as a function of redshift for both the TN 
and CC model sets. 

In Figure [6] the input light curves are based on 
the TN model subset, so a perfectly accurate clas- 
sifier would produce a completely black image, by 
returning a 100% probability that every synthetic 
light curve is a TN SN. The SOFT classifier is 
not perfect, so most of the boxes in the left panel 
have average PMGtn values that are less than 
100%, particularly in the upper right corner, for 
high values of redshift, distance, and extinction. 
In Figure [7J the regions where the input CC mod- 
els are imperfectly classified with some non-zero 
value for PMGtn appear gray instead of white. 
Almost all of this contamination arises from light 
curves based on Type Ib/c models. 

Of the 5000 light curves that contribute to Fig- 
ure [6j SOFT calculates a PMGtn classification 
grade that is greater than the PMGcc grade for 
76% of them. To limit the false positives and false 
negatives from the classification results, we can 
apply a simple cut to remove the most problem- 
atic objects by rejecting light curves with a peak 
signal to noise (S/N) ratio that falls below some 



threshold. Figure [8] shows the mean classification 
grades as a function of redshift for all SNe (dotted 
lines) as well as the classification grades from sub- 
sets with S/N> 3 (dashed) and S/N> 10 (solid). 
After applying the still very modest S/N> 10 cut, 
the fraction of TN SNe that are correctly identified 
by SOFT increases to 98%. For this S/N> 10 sub- 
set, the mean TN SN classification grade is > 90% 
across all redshifts. 

For the synthetic CCSNe, the S/N> 10 cut 
ensures an average PMGcc classification grade 
above 80% across all redshifts. The primary 
source for misclassifications of synthetic CC SN 
light curves is the Type Ib/c objects. In particu- 
lar, the SOFT classifier is often confused by Type 
Ib/c SNe at low redshift that have a very low S /N 
(i.e. those with heavy host galaxy extinction or a 
very large synthetic distance modulus /i e ). 

The results from our verification test of the 
SOFT classifier using 222 real SNe are shown in 
Figure [9] As was done in the Monte Carlo sim- 
ulations, we applied the SOFT classification pro- 
gram to all of these SNe using a uniform redshift 
prior (i.e. we ignore any host galaxy or SN spec- 
troscopy), as well as a uniform prior on /i e (i.e. we 
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Fig. 6. — Results from the synthetic light curve classification tests. Using TN SNe in Table [l] as models, we 
constructed 5000 synthetic light curves spread uniformly over the parameter space. For each light curve we 
used the SOFT method to calculate PMGtn and PMGcci the normalized composite membership grades 
for the TN SN and CC SN fuzzy sets, respectively. We then divide the parameter space of \x e and z into bins 
of size A^ e =0.05 mag and Az=0.05. For each bin we collect all the synthetic light curves that had input 
values of fi e and z drawn from that region of parameter space, and then compute the average membership 
grade returned from that bin. We then set a grayscale value for each bin, with black (red in the electronic 
version) indicating that the average PMGtn for a bin is 100%, and white (blue) indicating PMGcc— 100%. 
In the right panel we have done the same binning and averaging, this time with the y-axis marking Ay- See 
the electronic edition of the Journal for a color version of this figure. 



do not assume any cosmology). For the combined 
set of 217 Type la SNe from SDSS and SNLS, we 
found that 204 of these objects (94%) returned 
classification grades that favor a TNSN classifi- 
cation. The 13 objects that SOFT misclassified 
were all from the SDSS survey, and in most cases 
we can deduce the reason for the error. 

Six of the thirteen misclassified objects were 
given a spectroscopic classification of "SNIa?" by 
Holtzman et al. ( 2008 1 , indicating some classifi- 



cation ambiguity even when their spectroscopic 
information is considered. Two more have in- 



complete light curves, with only one or two ob- 
servation epochs beyond the light curve peak. 
Two of the misclassifications are objects with ex- 
tremely broad light curves. SN 2005mo and the 
SDSS object SN7017 (no IAU designation avail- 
able) both exhibit a decline rate substantially 
slower than any of the objects in our template li- 
brary. Using the SALT2 light curve fitter (Guy 



et al.|2007 l as an independent check, we measured 



values for the XI (stretch) parameter of 3.0 for 
2005mo and 5.0 for 7017. For comparison, these 
values are more than two times larger than the 
slowest-declining Ia+ objects in our template li- 
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Fig. 7. — Classification probabilities, as in Figure [6) In this case we generated 5000 CC SN light curves 
with random parameter vectors using the CC templates from Table [T] and once again computed the nor- 
malized fuzzy set classification grades using SOFT. As before, black (red in the electronic version) indicates 
PMGtn=WQ% and white (blue) indicates PMGcc=100%. See the electronic edition of the Journal for a 
color version of this figure. 



brary (SN 1991T,90N,99gp). Furthermore, these 
SALT2 XI values are larger than 98% of the ob- 
jects in the Cf A3 "constitution set" of Hickcn et al.| 
(2009). Unlike light curve fitters that parameter- 



ize the light curve shape (such as SALT2), the 
SOFT classifier is unable to extrapolate to light 
curves with shapes well beyond the limits of its 
template library. In order to correctly classify all 
of these extremely broad Type la SNe, we would 
need to include some very broad Type la template 
light curves in the library. 

A final outlier is the very peculiar SN 2005gj, 
which SOFT classifies as a Type Iln object. This 
object is an example of the rare sub-class of hy- 
brid Type la/Type Iln objects, exhibiting spectral 
indications of a substantial circumstellar medium 
( |Aldering et al.|2006[ ). Although we included tem- 



plates of some peculiar Type la SNe in the SOFT 
library (SN 2000cx and 2002cx), we do not have 
any representatives of this hybrid "Ha" class. 

6. Other Models 

The use of fuzzy set theory in the SOFT method 
allows us to escape from the assumption that our 
set of SN templates describes all possible light 
curve shapes. There is, however, an additional as- 
sumption that we have not addressed: the SOFT 
method still makes the implicit assumption that 
all objects being classified are already known to 
be SNe. In our verification tests this assumption 
is perfectly valid, because the Monte Carlo simula- 
tion only generates synthetic SN light curves, and 
the published light curves have all been spectro- 
scopically confirmed as real SNe. When selecting 
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Fig. 8. — Mean classification probabilities as a function of redshift. Here we have summed over all fi e 
and Ay values, collapsing the grids from Figs. [6] and [7] The left panel shows classification results for 5000 
synthetic light curves based on TN SN models. The right panel shows the CC SN results. In both panels 
the (orange) circles show the average PMGtn value as a function of z and the (blue) triangles indicate the 
average PMGcc- In each case the dotted lines show mean classification grades from all 5000 synthetic SNe. 
Dashed lines are from a subset of SNe that have a peak signal to noise ratio above 3. The solid lines are 
constructed from SNe with signal to noise above 10. See the electronic edition of the Journal for a color 
version of this figure. 



SN candidates from a real survey this assumption 
is often still justified because the set of all detected 
transients is passed through a series of preliminary 
filters to remove objects that are not likely to be 
SNe. For example, variable stars can mimic a SN 
light curve, but these can be removed by reject- 
ing transients that are coincident with a stellar 
source in static sky images, as well as any tran- 
sients that show clearly periodic brightness fluc- 
tuations. However, even with aggressive rejection 
thresholds, some non-SN objects may slip through 
to a candidate list. Furthermore, if potential SN 
candidates are vigorously rejected before attempt- 
ing classification, then some actual SNe are likely 
to be unintentionally discarded, too. If this pos- 
sibility is not accounted for, then any task which 
requires a complete supernova sample (such as a 
SN rate calculation) will give misleading results. 

The most thorough way to address this problem 
in the BATM/SOFT framework would be to add a 



new model for every possible class of SN impostor. 
In principle, one could construct light curve tem- 
plates for RR Lyrae, Cataclysmic Variables, Ac- 
tive Galactic Nuclei, stellar flares, etc. Each light 
curve template could be adapted with the same 
location parameters 6 that were used for our SN 
templates, and the mechanics of computing prob- 
ability distributions would remain the same from 
there. This solution is computationally complex, 
and is hindered by the great diversity of possible 
transient sources. We do not pursue this template- 
based approach here, but instead we consider how 
two non-SN models may be introduced to catch 
two specific categories of transient contaminants. 

6.1. Zero Flux Model 

A first category of non-SN interlopers that are 
often found in SN candidate sets are objects with a 
very low signal to noise ratio across the light curve. 
These objects may have light curves that are in- 
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Fig. 9. — Classification results for 222 SNe from 
the SDSS and SNLS surveys. The y axis carries 
the normalized SOFT classification grade, from 
Equation |18| The x axis marks the published spec- 
troscopic redshift, from either the SN itself or its 
host galaxy. 146 Type la SNe from the SDSS sur- 
vey are plotted as (orange) circles, the 71 Type 
la SNe from SNLS are shown as (magenta) dia- 
monds, and the (blue) triangles indicate 5 Type 
IIP objects from SNLS (note that two of these IIP 
SNe have almost identical redshift and classifica- 
tion grades, so their triangles are indistinguish- 
able) . 



trinsically very different from SN light curves, 
but they have managed to slip through the pre- 
classification filters because the data quality is 
poor enough to allow some ambiguity. When we 
encounter a low-flux light curve, if we assume that 
it is a SN in order to apply a SOFT classification, 
then we are artificially inflating the SN classifica- 
tion probabilities by ignoring the possibility that 
it is some other type of object. 

To address these contaminants within the 
BATM/SOFT framework, we can introduce a 
"Zero Flux Model", (ZFM) which asserts that 
there is no SN observed, so the true flux of the 
light curve is zero everywhere. The Bayesian like- 
lihood of a candidate match using this model is 



similar to Eq. [5] but without any location param- 
eters 6, and without the model flux term T: 



N 



p(D|ZFA/) = n^^exp 



2 a? 



(19) 



To include this likelihood in the set of PMPs for 
Eq. [8] we must assign a prior p(D\ZFM). Ide- 
ally, this prior should represent an initial guess at 
the fraction of low signal to noise candidates that 
arc false positives. A carefully chosen ZFM prior 
would reflect the details of the data reduction and 
candidate selection pipelines. In practice, it will 
generally be sufficient to apply a rough prior that 
is of the same order as the SN model priors, letting 
the data drive the ZFM probability. 

The ZFM is also useful in providing a measure 
of the confidence that should be given to SN classi- 
fications. The observed light curve of a very faint 
SN can often be matched equally well by many 
light curve templates because the observational er- 
rors are so large. BATM and SOFT would then 
classify the object based primarily on the model 
priors defined in §3.1.1 In that case our classifi- 
cation grades should reflect the fact that the data 
are providing little or no new information. The 
ZFM model achieves this, by inflating the normal- 
ization factor of Eq. [8] whenever the light curve 
signal gets buried by observational noise. 

This effect of the ZFM on classification grades 
is illustrated in Figure |10[ where we present a syn- 
thetic light curve that is entirely made up of ran- 
dom noise. There is no real signal for SOFT to 
fit to, but if the only models available to it are 
SN templates, then SOFT will necessarily classify 
the "object" into one of the available SN classes. 
When the ZFM is included, however, we find a 
significant likelihood that there is no object there 
at all, and the renormalized classification grades 
reflect the ambiguity of this very low-signal event. 

6.2. Something Else Model 

The recent discovery of the peculiar optical 
transient SCP 06F6 with the Hubble Space Tele- 



scope ( jBarbary et al. 2009[ ) provides a prime exam- 
ple of another category of non-SN contaminants. 
This object has a bright, clear light curve that 
is superficially rather similar to a SN, but it is 
unlike any known supernova both photometrically 
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Fig. 10. — A synthetic light curve example 
demonstrating the importance of including a zero 
flux model (ZFM) to weed out over-confidence. 
This "light curve" was generated as pure random 
noise, and was then passed through the SOFT pro- 
gram to measure normalized membership grades 
for classification. If SOFT only considers the 
SN templates then the classification probabilities 
would be p(CC\D) = 0.91 and p(TN\T>) = 0.09. 
The maximum likelihood SN template match in 
this case comes from the Type la SN 1996X, shown 
here as the solid curves. When the ZFM is in- 
cluded, the renormalized classification grades be- 
come p{CC\T>) = 0.67, p(TN\T>) = 0.06, and 
p(ZFM\T>) = 0.27. See the electronic edition of 
the Journal for a color version of this figure. 



and spectroscopically (Gansicke et al. 2009). If a 



transient similar to SCP 06F6 were detected by a 
survey like Pan-STARRS or LSST, its light curve 
would not be initially reviewed by human eyes 
(with thousands of new candidates discovered per 
night, there just aren't enough grad students avail- 
able). After the SCP 06F6-like candidate passes 
through preliminary sifting it would probably be 
labeled as "SN-like" and get passed off to an au- 
tomated classification algorithm such as BATM or 
SOFT. If the classifier only compares this object 
against a set of SN templates, it will necessar- 
ily misclassify the object, no matter how broad 
or deep the template library is. 



To account for the possibility that a candidate 
light curve could be something that resembles a 
SN but is not one of the known SN types, we need 
a flexible model that can fit any conceivable light 
curve reasonably well. For this purpose, we intro- 
duce the "Something Else Model" (SEM), a simple 
non-physical model designed to replicate a non- 
periodic light curve. The SEM consists of a set 
of spline curves that are required to pass through 
a set of 5 magnitude points positioned at regu- 
lar intervals in time. The outer two points are 
positioned at the end-points of the observed light 
curve, and the other three are spaced evenly over 
the interior. The magnitude of each knot is a 
free parameter that is allowed to vary over a range 
Ami from zero to 30 magnitudes, with a uniform 
prior across that region. This span comfortably 
encompasses the range of magnitudes accessible to 
any SN survey. Since this model is not physically 
motivated, we model each bandpass completely in- 
dependently. The posterior likelihood for a given 
vector of spline knots niK is calculated in the same 
manner as Eq. [5] 



p(D\m k ,SEM) = J! 



JY 



/27T(Ti 

v „„„ ( -(.fi-FsEM(ti,m k ))'* 

x exp ^ 

(20) 

To determine the net likelihood for the SEM, 
we must integrate this over all possible spline knot 
vectors m k . 

p(D\SEM) = J p(m k )p(D\m k ,SEM)dm k 

m k 

(21) 

To simplify the computation of this integral, we 
can use a rough maximum likelihood approxima- 
tion. By design, there will always be some set of 
knots for which this model can provide a reason- 
ably good fit to any light curve, so the likelihood 
distribution over the knot space will be sharply 
peaked. In such a regime, we can estimate the in- 
tegral of Eq. [21] as the area of that single peak. 
Let us denote the height of the peak as the max- 
imum likelihood C max = max(p(D|m/j, SEM)), 
and the width of the peak in the plane of param- 
eter mk as 5m k . A first-order approximation for 
the peak area in that plane is C max 8m k . To apply 
a uniform prior over the range Atoj. we multiply 
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by a factor l/Am^, and taking the product over 
all knots in all bandpasses, we get C max ]J 2^T~- 
This estimate can then be included as the numer- 
ator of Equation [8] to compute the SEM posterior 
model probability, p(SEM[D). The components 



of this approximation are illustrated in Figure 11 
The product Y\ , sometimes referred to as the 
"Occam factor," decreases rapidly as the number 
of knots increases, and therefore it effectively pun- 
ishes the SEM model for using many more param- 
eters than the SN template models (for a deriva- 
tion of this maximum likelihood approximation 
and further discussion of the Occam factor, see 
section 3.5 of Gregory ( 2005[ )). 

To get C max we find the maximum likelihood 
spline fit for each bandpass using a downhill sim- 
plex minimization and defining each likelihood 
value with Equation [6j We then estimate Srrik by 
sampling the likelihood distribution in the vicinity 
of the peak to find the full width at half maximum 
(see Fig. 
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The final step to include this SEM in our total 
probability calculation is to define an appropri- 
ate model prior. As with the ZFM, the p(SEM) 
prior should reflect the data processing pipeline by 
providing a low probability when the upstream fil- 
tering of candidates is more stringent. In general, 
the occurrence of true "Something Else" objects 
is expected to be very rare, so a value oip(SEM) 
that is three or more orders of magnitude less than 
the SN model priors would be appropriate. 



In Figure 12 we demonstrate the application of 
th is SEM using th e two-c olor SCP 06F6 light curve 
of Barbary et al. ( |2009[ ). If we try to determine 
the probability that this object is a CC or TN 
SN, then BATM and SOFT measure low values for 
both p{CC) and p{TN). If we leave out the SEM 
comparison, then when these low likelihood values 
are normalized against themselves using Eq.[8] the 
result is a pair of grossly exaggerated probabilities. 
By including the SEM case, BATM and SOFT 
can report with high confidence that this object is 
unlike both the CC and the TN classes. 

The structure of the SEM outlined here is nec- 
essarily very loose. Due to the relatively large 
number of parameters and the wide open param- 
eter space, this model will usually reach a rea- 
sonably high maximum likelihood C ma x- Those 
same flexibility features, however, will then sub- 
stantially reduce the posterior probability through 




value of spline coefficient m k 

Fig. 11. — Estimating the area of the SEM likeli- 
hood distribution along one dimension of parame- 
ter space. The curve represents the posterior prob- 
ability p(D\rnk, SEM). The rectangle shows the 
uniform prior p{nik) = I/Atoj. spanning a wide 
range of possible spline knot values. A down- 
hill simplex minimization determines the value of 
the spline parameter nik that yields the maximum 
likelihood value C ma x- Through numerical sam- 
pling of the likelihood distribution around that 
point, we can estimate the width of the likeli- 
hood peak, Srrik, and get a rough approximation 
for the area under the curve as C max Srrik ■ The 
prior times the posterior probability is then esti- 
mated as the product of C rnax with the Occam 
factor, Srrik / Arrik- This figure adapted from Gre 
gory| ( |2005[ ). 



the Occam factor, because the characteristic width 
Srrik is generally much smaller than the parameter 
range Arrik- The size of the SEM prior, the al- 
lowed parameter range Arrik , the number of spline 
knots per filter and their spacing in time are all im- 
portant components in determining the posterior 
probability p(SEM\D). These values should be 
tuned for any particular survey by fitting the SEM 
to known SN transients as well as examples of pos- 
sible non-SN contaminants. A suitably optimized 
SEM will yield a relatively low p(SEM\D) for real 
SNe, while still providing a non-zero p{SEM\D) 
for unusual interlopers. 
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Conclusions and Future Work 
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Fig. 12. — Light curve fits for the unusual tran- 



sient object SCP 06F6. The data points from Bar- 
bary et al. ( |2009| ) are shown as circles for the the 
HST F775W (Y) band, and as squares for F850LP 
(z 1 ). Top: The best fitting template is the Type 
Iln SN 1994Y, at a redshift of 0.2, but even this 
model provides a very poor fit. Bottom: The SEM 
is capable of providing a very good fit to these 
data. The diamond symbols mark the locations 
of the spline knots that define the SEM fit. In 
this case the likelihood of a match for the SEM is 
larger than that from any of the SN templates. 



We have presented in detail the updated BATM 
(Bayesian Adaptive Template Matching) program, 
and showed that a strictly probabilistic framework 
is insufficient for SN classification using fixed- 
shape templates. To overcome the practical and 
theoretical shortcomings of the BATM approach 
we have introduced SOFT, a method based on 
Fuzzy Set Theory. This technique allows us to 
classify SNe into the broad categories of thermonu- 
clear (TN) and core-collapse (CC) SNe using pho- 
tometry alone. 

The SOFT method is demonstrated on simu- 
lated observations of synthetic SNe modeled af- 
ter the Pan-STARRS 1 (PS1) survey. Sampling 
a wide range of redshift, distance and extinction 
we find that SOFT can produce average classifica- 
tion accuracies above 90% for TNSNe and above 
80% for CC SNe. Additional verification of the 
SOFT classification accuracy is presented with 
146 TN SN light curves from the SDSS-II survey, 
which again yield better than 90% classification 
accuracy. Almost all of the SDSS SN misclassi- 
fications could be influenced by peculiarities and 
ambiguities in the light curves themselves. 

We suggest two additional non-SN models that 
can be helpful in identifying anomalous objects 
within a SN candidate set. The Zero Flux Model 
catches SN impostors that have a very low signal 
to noise ratio, and the Something Else Model pro- 
vides a flexible, non-physical model to assist in the 
identification of unusual objects like the peculiar 
transient SCP 06F6. 

A number of further refinements to the SOFT 
program could improve its functionality and ac- 
curacy for automated SN classification. The 
strength of any template matching software is 
limited by the size and quality of the template 
library it draws from. The set of templates used 
for this work has been much enhanced in this 
regard since the earliest incarnation of BATM, 
but there is much room for further development. 
In particular, collecting template light curves with 
more complete UBVRI data - especially in the rise 
time of the light curve - would allow for more ac- 
curate spline fits to define the primary templates. 
This is particularly important for the CC SNe, 
for which the available collection of well-sampled 
light curves and early-time spectra is notably thin. 
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Adding to the template library some light curves 
observed at high redshift could provide a useful 
supplement to the scarce UV data from the low-z 
sample. 

The SOFT method introduces two new param- 
eters that characterize the template library itself: 
Oj quantifies the amount of "fuzziness" and the q 
parameter controls the strength of the Fuzzy com- 
bination operator. These parameters reflect the 
number and diversity of templates in the template 
library, and appropriate values can be estimated 
from a comparison of templates against each other. 
We performed a preliminary calibration of these 
parameters and settled on ct-tn — 0.1 for TNSN 
templates, acc = 0.15 for CCSN templates, and 
q=0.2 for all combinations. A more precise deter- 
mination of a'j and q using a large training set of 
well-observed SNe would be a useful next step. 

In addition to classifying SNe, the SOFT 
method can be used to determine parameter esti- 
mates for candidate SNe, based on the comparison 
of light curves. This extension of the SOFT tech- 
nique is presented in a companion paper (Rodney 
and Tonry, 2009), where we focus on the use of 
SOFT to derive redshift and distance estimates 
from TN SNe for constraining cosmological mod- 
els. 
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